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Specify an arbitrary project name, 
(prose) descriptions of the project. 

Project name 



This will be used in textual 



Nissan 350z 



< Prev Next > 
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Data Sources 



Specify a variety of data feeds from which we will harvest out 
data. These sources will be used during both training and 
production. 

Project name 



gaz/350z-train 



Add.. 



Delete 



For maximum coverage in our final reports, we wish to harvest 
from as many different engines and forums as possible. 
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Describe the product category that this project focuses on by 
entering a set of phrases which identify, describe, or are 
associated with the type of product. Enter one phrase per line. 



sports car 



Do not enter specific brand names here. You will be asked to 
do that later. 



< Prev Next > 



Exit 
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Customer 



Enter a set of phrases that name the customer and their product 



nissan 
350z 
350 
z 
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Enter a set of phrases that name competing companies and 
branded products relevant to this project. Enter one phrase 
per line. 



honda s2000 
corvette 
bmw 325i 



< Prev 



Next > 



Exit 
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Counter-Examples 



Enter some phrases which indicate that a message is not 
related to the key concept, in spite of any superficial 
similarity. 



< Prev 



Next > 



Exit 



FIG. 6 



Title: METHOD FOR DEVELOPING A CLASSIFIER FOR 

CLASSIFYING COMMUNICATIONS 

Inventor: Nigam et al. 

S/N: [new non-provisional application] 

Filed: March 16, 2004 

Docket No: BL055-GN016 



EL 



0) 

u 

S 

a 
s 



0> 



« t5 2 S, 



s 

O 



2 °- 

Q. « 

^7X3 
« B 



B 



"3 

B 
B 

O 

-■a 

0) 

B 

c 

>- 

+■» 

•c 
u 



o 



5 g 

as 

M B 
<u B 

S"S 
< 2 



till 

- a . g 

£ oe 
a B 

B <« X 

a -B im © 

2J B O > 

E © a S 

ft s b § 

Jf o » 2 

« y « * 



" 5 s 2 



A U a 

«!<2 2 




b 



is 

7. o 

o 00 



S 
o 



W3 S3 

0) O ctf 

g i« u 

§ a ' C 5 

x An o 

» s.s 

o« ts h -b c 

c H « W <D 

C P ** jo +5 

O ot ed C 

a u o J u 




O <N t}- 



Title: METHOD FOR DEVELOPING A CLASSIFIER FOR 

CLASSIFYING COMMUNICATIONS 

Inventor: Nigam et a). 

S/N: [new non-provisional application] 

Filed: March 16, 2004 

Docket No: BL055-GN016 



EL 



► ► 



0) 



V 

& 8 

© +■» 



£ 

a 
a 
o 



0Q 

"C 

■c 

u 

Ol 



5 2 

.2 g 
© a 

If 
< 



s « "2 S. 
"2 S £ g 
2 ■© a e 

o. g 2 w 

«;a a g 

I si s 

>• © £ © 

c G £ « 

m © G O 
© 



•s f * „ 

© — C So 



CO 

g 

a> ex 1 

£ CO 

* i 
51 

o g 



d 
o 

CO 

o 



c 

co 



* g 

O co 

>• s 

V O 

- 2 
II 

to g 

it 



o 

GO 

go 

O 



GO 



OS 

u 



oo 

a 



♦? n 55 



co fl 

O O etf 

C to CD 

C fll 

c S « <U (D 

o o C cs C 

u u u .-J a 




O ^ (N rn ^* 



6/8 



Title: METHOD FOR DEVELOPING A CLASSIFIER FOR 

CLASSIFYING COMMUNICATIONS 

Inventor: Nigam et aJ. 

S/N: [new non-provisional application] 

Filed: March 16, 2004 

Docket No: BL055-GN016 



O] BUFFET 



Overview Labeling Criteria 



1. Project Info 

2. Data Sources 

3. Product Category 

4. Customer 

5. Competitors 

6. Counter-examples 

7. Criteria Questionna.. 

8. Labeling Criteria 

9. Criteria Document 

10. Load messages 

1 1 . Label messages 

12. Compute expecte... 

13. Expected perfor... 

14. Done 



Edit the labeling criteria that were derived from your 
questionnaire, and from other additions you have made 



Opinions or comparisons about the product itself are 
Descriptions or usage statements about the product it 
Opinions or comparisons about the products price are 
Opinions or comparisons about a feature of the produ 
Descriptions of usage statements about a feature of th 
Descriptions, discussions and opinions of a news artic 
Mere mentions of a news article mentioning the prod 



Add.. 



Edit, 



Delete 



Please make an effort to add at least several key words to 
each criteria element. This helps BUFFET operationalize your 
criteria. 



< Prev Next > 



Exit 
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Criteria Document 



This creates a human-readable criteria document. Enter a 
filename, and we will write the document to it. 



/home/knigam/silly2.txt 



Select 



< Prev 



Next > 



Exit 
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At this time, you are encouraged to stop and apply tags to 
more harvested messages. Ideally, you should label several 
hundred messages to get best results 



266 messages tagged | Label Messages 



(You can always return to this screen later and tag more 
messages.) 



Ol Analyst Workbench [ 



File Import Benchmark Target Polarity Topic Phrase 



Exploration 



Polarity 



j Summary | | Keywords | | Suggestions | | Properties 



Project: Pocket PC 



Harvest: 9320 messages from 229 queries. 
Benchmark: 400 messages labeled: Quality: 0.836; Consistency: 1.000 
Target: 1000 messages labeled: Quality: 0.844; Consistency: 0.925 
Polarity: unknown 
Topics: unknown 



Publish Configuration 



FIG. 1 1 
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|Ol Analyst Workbench 



File Import Benchmark Target Polarity Topic Phrase 



Project 


Import 


Benchmark 


Target 


Polarity 


Topic 


Phrase 




f Summary 


Label 


Performance 





Subject: Brighthand reviews the Sony CLIE PEG-NZ90 
Date: Tue Feb 04 00:00:00 EST 2003 
From: Covert 



Engine: discussion.brighthandxom 
Forum: Reviews 



+> 
+> 
+> 
+> 

+>Originally posted by hepv 

+> 

+> Anyway... how come we don't have anyf 



j manufactures making these cool multimedia centric devices (niche). 



We did - they were Casio. Casio pioneered PDA multimedia with its Palm PCs and first[^HHEi> but now t hey're 
agree that someone needs to step up and release a RTiffWltli that has added multimedia value over other iagtf s. I 

39xx screen, old E-125/Maestro joypad, removable battery, Zayo speed, etc.), I'd buy it. 



Covert of www.cghm.Bk.com 



View Raw Document View Louie 



Labelling doc #5 (Unlabelled) 



FIG. 12 



